Performance evaluation methodology for document image dewarping techniques
نویسندگان
چکیده
The performance evaluation of dewarping techniques is currently addressed by concentrating in visual pleasing impressions or by using optical character recognition (OCR) as a means for indirect evaluation. In this study, the authors present a performance evaluation methodology that calculates a comprehensive evaluation measure which reflects the entire performance of a dewarping technique in a concise quantitative manner. The proposed evaluation measure takes into account the deviation of the dewarped text lines from a horizontal straight reference which is considered to be the optimal result. This measure is expressed by the integral over the dewarped text line curves. To reduce the manual effort for identifying the text lines in the dewarped image, the authors propose a point-to-point matching procedure that finds the correspondence between the manually marked warped document image and the dewarping counterpart. This enables an evaluation for unlimited number of methodologies addressing a marking procedure which is applied only once. The validity of the proposed performance evaluation methodology is demonstrated by a concise experimental work that comprises four state-of-the-art dewarping techniques as well as the involvement of different users in the interactive part of the procedure.
منابع مشابه
Document Image Dewarping Based on Text Line Detection and Surface Modeling (RESEARCH NOTE)
Document images produced by scanner or digital camera, usually suffer from geometric and photometric distortions. Both of them deteriorate the performance of OCR systems. In this paper, we present a novel method to compensate for undesirable geometric distortions aiming to improve OCR results. Our methodology is based on finding text lines by dynamic local connectivity map and then applying a l...
متن کاملAn Image Based Performance Evaluation Method for Page Dewarping Algorithms Using SIFT Features
Dewarping of camera-captured document images is one the important preprocessing steps before feeding them to a document analysis system. Over the last few years, many approaches have been proposed for document image dewarping. Usually optical character recognition (OCR) based and/or feature based approaches are used for the evaluation of dewarping algorithms. OCR based evaluation is a good meas...
متن کاملDewarping of Document Images using Coupled-Snakes
Traditional OCR systems are designed for planar (dewarped) images and the accuracy is reduced when applied on warped images. Therefore, developing new OCR techniques for warped images or developing dewarping techniques are the possible solutions for improving OCR accuracy camera-captured documents. Among different types of dewarping techniques, curled textlines information based dewarping techn...
متن کاملPerformance Evaluation of Curled Textlines Segmentation Algorithms
Curled textlines segmentation is a necessary initial step for the hand-held camera-captured document image processing. Curled textlines information is often used as an intermediate step for camera-captured document image dewarping. Curled textlines information can also be used for other camera-based document image processing tasks, like layout analysis etc. So far no work has been done for the ...
متن کاملDocument Image Dewarping Contest
Dewarping of documents captured with hand-held cameras in an uncontrolled environment has triggered a lot of interest in the scientific community over the last few years and many approaches have been proposed. However, there has been no comparative evaluation of different dewarping techniques so far. In an attempt to fill this gap, we have organized a page dewarping contest along with CBDAR 200...
متن کامل